Accurate Prediction of Immunogenic T-Cell Epitopes from Epitope Sequences Using the Genetic Algorithm-Based Ensemble Learning

نویسندگان

  • Wen Zhang
  • Yanqing Niu
  • Hua Zou
  • Longqiang Luo
  • Qianchao Liu
  • Weijian Wu
  • Francesco Pappalardo
چکیده

BACKGROUND T-cell epitopes play the important role in T-cell immune response, and they are critical components in the epitope-based vaccine design. Immunogenicity is the ability to trigger an immune response. The accurate prediction of immunogenic T-cell epitopes is significant for designing useful vaccines and understanding the immune system. METHODS In this paper, we attempt to differentiate immunogenic epitopes from non-immunogenic epitopes based on their primary structures. First of all, we explore a variety of sequence-derived features, and analyze their relationship with epitope immunogenicity. To effectively utilize various features, a genetic algorithm (GA)-based ensemble method is proposed to determine the optimal feature subset and develop the high-accuracy ensemble model. In the GA optimization, a chromosome is to represent a feature subset in the search space. For each feature subset, the selected features are utilized to construct the base predictors, and an ensemble model is developed by taking the average of outputs from base predictors. The objective of GA is to search for the optimal feature subset, which leads to the ensemble model with the best cross validation AUC (area under ROC curve) on the training set. RESULTS Two datasets named 'IMMA2' and 'PAAQD' are adopted as the benchmark datasets. Compared with the state-of-the-art methods POPI, POPISK, PAAQD and our previous method, the GA-based ensemble method produces much better performances, achieving the AUC score of 0.846 on IMMA2 dataset and the AUC score of 0.829 on PAAQD dataset. The statistical analysis demonstrates the performance improvements of GA-based ensemble method are statistically significant. CONCLUSIONS The proposed method is a promising tool for predicting the immunogenic epitopes. The source codes and datasets are available in S1 File.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In Silico Perspectives on the Prediction of the PLP’s Epitopes involved in Multiple Sclerosis

Background: Multiple sclerosis (MS) is the most common autoimmune disease of the central nervous system (CNS). The main cause of the MS is yet to be revealed, but the most probable theory is based on the molecular mimicry that concludes some infections in the activation of T cells against brain auto-antigens that initiate the disease cascade.Objectives: The Purpose of this research is the...

متن کامل

Prediction of T-cell epitopes for designing a reverse vaccine against streptococcal bacteria

Streptococcal bacteria are among dangerous human pathogens with major prevalence worldwide. A good vaccine against streptococcal bacteria should have epitopes that confer protection from infection by different streptococcal bacteria types. we aimed was to recognize the most immunogenic and conserved epitopes of streptococcal bacteria, which could be a potential candidate for vaccine development...

متن کامل

B and T-Cell Epitope Prediction of the OMP25 Antigen for Developing Brucella melitensis Vaccines for Sheep

Brucellosis, produced by Brucella species, is a disease that causes severe economic losses for livestock farms worldwide Due to serious economic and medical consequences of this disease, many efforts have been made to prevent the infection through the use of recombinant vaccines based on Brucella outer membrane protein (OMP) antigens. In the present study, a wide range of on-line prediction sof...

متن کامل

In Silico Prediction of B-Cell and T-Cell Epitopes of Protective Antigen of Bacillus anthracis in Development of Vaccines Against Anthrax

Protective antigen (PA), a subunit of anthrax toxin from Bacillus anthracis, is known as a dominant component in subunit vaccines in protection against anthrax. In order to avoid the side effects of live attenuated and killed organisms, the use of linear neutralizing epitopes of PA is recommended in order to design recombinant vaccines. The present study is aimed at determining the dominant epi...

متن کامل

Compatibility of B-Sheets with Epitopes Predicted by Immunoinformatic in Human IgG

Background & Aims: Antibodies, well-known as immunoglobulins (Igs), are produced by B lymphocytes and specifically defend against pathogens. Igs are glycoproteins and have high diagnostic value in several diseases including infections (1). Igs are composed of light and heavy chains (2, 3). Each chain is comprised of about 110-120 amino acid residues which create immunoglobulin folds named domai...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2015